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Abstract — The  problem  of  detecting  underwater  targets  from 
Electro-optical  (EO)  images  is  considered  in  this  paper.  A  block- 
based  log-likelihood  ratio  test  has  been  developed  for  detection 
and  segmentation  of  underwater  mine-like  objects  in  the  EO 
images  captured  with  a  CCD-based  image  sensor.  The  main  focus 
of  this  research  is  to  develop  a  robust  detection  algorithm  that 
can  be  used  to  detect  low  contrast  and  partial  underwater  objects 
from  the  EO  imagery  with  low  false  alarm  rate.  The  detection 
method  involves  identifying  frames  of  interest  (FOI)  containing 
the  potential  targets.  Once  the  FOI  have  been  identified,  regions 
of  interest  (ROI)  within  the  FOI  are  segmented  from  the  back¬ 
ground.  Performance  of  the  detection  method  is  tested  in  terms  of 
probability  of  detection,  false  alarm  rate,  and  receiver  operating 
characteristic  (ROC)  curves  for  FOI  in  the  selected  data  runs. 
The  algorithm  shows  promising  results  in  target  detection  and 
generation  of  good  silhouettes  for  subsequent  classification. 

Index  Terms — Binary  hypothesis  testing,  electro-optical  sen¬ 
sors,  underwater  target  detection 

I.  Introduction 

Automatic  detection  and  recognition  of  underwater  objects 
from  EO  imagery  poses  a  serious  challenge  due  to  poor 
environmental  and  operating  conditions  that  impair  the  quality 
of  the  captured  images.  Although  the  sensor  technology  for 
underwater  mine  identification  has  advanced  to  a  level  that 
these  systems  are  being  transitioned  into  the  fleet,  the  target 
identification  is  still  being  done  by  human  operators  [1].  The 
development  of  an  automatic  underwater  target  identification 
system  capable  of  identifying  various  types  of  underwater 
targets,  under  different  environmental  conditions  pose  many 
technical  problems.  Some  of  the  contributing  factors  are: 
Targets  have  diverse  sizes,  shapes  and  reflectivity  properties. 
Target  emplacement  environment  is  variable;  targets  may 
be  proud  or  partially  buried.  Environmental  properties  vary 
significantly  from  one  location  to  another.  In  particular,  the 
variation  in  the  turbidity  can  substantially  change  the  quality 
of  the  collected  data  and  hence  the  conspicuity  of  the  targets. 
Bottom  features  such  as  sand,  rocks,  corals,  and  vegetation 
can  conceal  a  target  whether  it  is  partially  buried  or  proud. 
Competing  clutter  with  responses  that  closely  resemble  those 
of  the  targets  may  lead  to  false  positives.  All  these  factors 
contribute  to  make  this  problem  a  very  complicated  one. 

Previous  work  on  EO  data  has  been  focused  on  Streak 
Tube  Imaging  Lidar  (STIL)  system  [1]— [4],  and  laser  line 


scan  (LLS)  [5]— [7]  based  systems.  STIL  sensor  produces  high- 
resolution  3-D  images  of  underwater  objects  by  scanning  (line 
by  line),  on  the  target  field  [1].  The  collected  raw  STIL 
data  is  rendered  to  produce  pairs  of  contrast  (gray-level) 
and  range  (distance)  maps.  The  previous  work  focuses  on 
filtering,  segmentation,  and  classification  of  underwater  mine¬ 
like  objects  from  pre-cropped  regions  of  the  STIL  scans. 

In  this  paper  we  present  the  development  of  a  block-based 
log-likelihood  detector  for  use  with  a  EO  imagery  database 
different  than  that  produced  by  the  STIL  EO  sensor.  This 
database  was  collected  using  a  new  CCD  EO  system  capable 
of  producing  ocean  bottom  snapshot  images.  The  new  CCD 
EO  system  is  contained  inside  the  Bluefin  12  underwater 
unmanned  vehicle  (UUV)  developed  by  Bluefin  Robotics 
Corp.  and  is  capable  of  capturing  subsequent  snapshots  over  a 
target  field  as  the  vehicle  is  moving.  The  proposed  detector  is 
capable  of  identifying  frames  of  interest  (FOI)  which  contain 
potential  targets  within  data  runs  produced  by  the  CCD  sensor, 
as  well  as  segmenting  the  regions  of  interest  (ROI)  from  the 
detected  frames.  An  important  benefit  of  the  proposed  detector 
lies  in  the  fact  that  detection  of  FOI  and  ROI  segmentation 
can  be  achieved  in  a  single  step.  FOI  detection  followed  by 
automatic  ROI  segmentation  will  reduce  the  number  of  objects 
from  which  features  need  to  be  extracted,  as  well  as  reduce 
the  overall  processing  load  on  the  classifier. 

This  paper  is  organized  as  follows:  In  Section  II  we  will 
discuss  the  CCD  sensor  and  its  properties.  Section  III  describes 
the  sensor  data  and  challenges  associated  with  detecting  ob¬ 
jects  from  this  EO  database.  Section  IV  discusses  the  design 
and  implementation  of  the  block-based  detector.  Section  V 
assesses  the  performance  of  the  detector  on  the  EO  database. 
Finally,  Section  VI  gives  conclusions  on  this  work  and  dis¬ 
cusses  future  work. 

II.  CCD  Sensor  Description  &  Properties 

This  section  provides  an  overview  of  the  data  collected  by 
the  CCD  system.  The  sensor  used  in  this  work  employs  a  DVC 
1500  monochrome  CCD  camera  coupled  with  a  Philips  Lu- 
miled  Luxeon  Flood  18  LED  illuminator  [8].  The  illuminator 
has  a  luminous  flux  >  500  lumens,  and  the  CCD  camera  is 
capable  of  producing  images  of  sizes  1394  x  1040  (6.45um 
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TABLE  I 

Tested  CCD  Data  Set 


Frame  N 


Frame  N+3 


Frame  N+1 


Frame  N+4 


Frame  N+2 


Frame  N+5 


Run 

Total  Frames 

Target  Frames 

SAM001_003 

42 

0 

SAM004_001 

35 

0 

SAM22_011 

35 

0 

SAM23_003 

293 

3 

SAM23_004 

287 

3 

SAM23_005 

293 

4 

Target  Y8_001 

136 

3 

TargetY8_003 

29 

0 

Target Y8_004 

32 

0 

Target  Y8 006 

135 

3 

Totals 

1317 

16 

x  104  Typical  Distribution  Target/Non-Target  Pixel  Intensities 


Pixel  Intensities 


Fig.  2.  Histograms  of  typical  Target  and  Non-Target  frames. 


Fig.  1.  A  sequence  of  frames  from  run  TargetY8_001  containing  target 
frames  (both  full  and  partial  targets). 

pixel  size)  with  gray  level  resolution  of  12-bit/pixel.  The 
camera  also  supports  multiple  binning  modes  (lxl  to  8x8)  [8]. 
An  example  of  a  frame  sequence  is  shown  in  Figure  1.  The 
focus  of  this  work  is  to  separate  the  frame(s)  of  interest  (FOI), 
which  contain  potential  targets  from  those  that  do  not  contain 
targets.  The  second  objective  is  to  segment  the  mine-like  object 
within  an  FOI  from  the  background  in  order  to  classify  it 
as  target  or  non-target.  As  mentioned  the  CCD  sensor  takes 
several  ocean  bottom  snapshots  in  a  run  making  the  detection 
process  different  from  the  previous  work  [1],  [2]-[4],  [8],  [9] 
on  EO  images. 

III.  CCD  Sensor  Data  &  Challenges 

The  CCD  image  data  consists  of  a  series  of  ocean  bottom 
snapshots  as  in  Figure  1 .  The  data  analyzed  in  this  study  con¬ 
sisted  of  five  data  runs  containing  targets,  and  five  containing 
no  targets  (just  background).  The  data  runs  used  together  with 
total  number  of  frames  per  run  and  target  FOI’s  are  given  in 
Table  I. 


As  can  be  seen  the  total  number  of  frames  in  the  data 
runs  is  1317,  of  which  16  frames  contain  targets.  For  these 
data  runs  the  CCD  system  was  set  to  produce  images  that 
are  684x513  pixels  at  12-bits  per  pixel  gray  level  resolution. 
The  data  imported  is  resized  to  512x512  pixels  for  ease 
of  computation  using  MATLAB  bicubic  interpolation  image 
resizing  algorithm.  Example  data  frames  containing  target 
(frames  N+3,  N+4,  N+5)  and  non-target  (frames  N,  N+1, 
N+2)  are  shown  in  Figure  1.  The  data  set  contains  only 
three  different  types  of  targets  namely  cylindrical,  circular, 
and  trapezoidal  targets. 

There  are  three  main  tasks  involved  in  designing  an  au¬ 
tomatic  target  detection  and  recognition  system  for  this  new 
EO  database.  The  first  is  FOI  detection,  which  is  the  key 
to  the  success  of  other  subsequent  processing  steps  namely 
feature  extraction  and  classification.  Since  only  a  few  out  of 
several  hundred  frames  in  a  run  may  contain  a  partial  or  full 
target  images,  it  is  important  to  isolate  only  those  frames 
which  contain  a  potential  target  from  those  of  background. 
The  next  task  is  segmentation  of  the  objects  within  the  FOI 
for  ROI  selection.  This  is  also  critical  due  to  the  fact  that 
background  and  mine-like  objects  tend  to  have  very  similar 
contrast  characteristics,  hence  making  the  segmentation  and 


discrimination  very  difficult  tasks.  The  third  task  involves 
designing  the  classification  system.  The  challenge  of  the 
detector  and  classifier  for  this  new  CCD  EO  database  lies  in 
the  fact  that  FOI  may  contain  partial  targets  (e.g.  see  frame 
N+3  and  N+5  in  Figure  1).  Partial  targets  cause  difficulties 
in  both  detection  and  classification  processes  due  to  the  fact 
that  the  extracted  ROI  may  not  contain  adequate  information. 
The  challenges  and  issues  in  these  tasks  are  discussed  in  more 
detail  below. 

1)  As  mentioned  before  each  data  run  contains  a  large 
number  of  frames  containing  only  background  and  few 
frames  containing  targets.  The  focus  of  this  work  is  to 
detect  FOI  within  the  runs,  and  extract  ROI  only  from 
the  detected  frames  containing  potential  targets.  Once 
ROI  are  extracted  from  the  FOI,  the  problem  becomes 
a  two-class  classification  problem  to  determine  if  the 
detected  object  is  a  target  or  a  non-target.  The  main 
challenge  involves  designing  a  detector  that  will  provide 
screening  mechanism  to  filter  out  any  frames  that  have 
no  object  of  interest.  If  a  target  exists  in  a  frame,  the 
frame  must  be  marked  as  FOI,  so  that  the  detected 
objects  contained  in  the  FOI  can  be  classified. 

2)  The  next  main  challenge  when  designing  a  detection 
system  for  this  new  EO  database  is  successful  ROI 
segmentation.  We  can  see  from  typical  target  and  back¬ 
ground  frames  in  Figure  1  and  histogram  of  example 
frames  (Target  N+4  and  Non-target  N+l)  in  Figure  2 
that  the  background  and  target  have  overlapping  gray 
level  intensities.  This  makes  it  difficult  to  employ  global- 
based  schemes  to  segment  the  detected  objects.  Also  the 
CCD  EO  images  do  not  provide  any  identifiable  texture 
to  allow  discrimination  between  target  and  background. 

3)  Finally  partial  targets  are  fragmented  ROI  within  a  FOI. 
This  can  occur  because  of  occlusion  or  when  only  a 
portion  of  a  target  is  captured  in  a  frame,  hence  causing 
two  problems.  The  first  problem  is  the  fact  that  a  partial 
target  may  be  very  small  and  indistinguishable  from 
background  anomalies.  Small  objects  pose  a  challenge 
since  the  detector  must  have  some  way  of  discriminating 
small  anomalies  from  very  small  portions  of  targets. 
Another  issue  involved  in  partial  targets  exists  in  the 
fact  that  these  small  ROI  must  be  classified  after  they  are 
detected.  A  classifier  may  incorrectly  classify  a  partial 
target  due  to  the  lack  of  discriminatory  information  of 
the  mine-like  object. 

IV.  Block-Based  Detection 

In  this  section  the  proposed  block-based  method  for  detec¬ 
tion  of  FOI  within  a  run,  and  determining  ROI  within  the 
detected  frames  is  described.  The  main  reason  for  taking  a 
local-based  (block)  approach  as  opposed  to  a  global-based 
approach  employed  in  [1],  [5],  [10]  lies  in  the  fact  that  FOI 
must  be  determined  for  every  data  run.  If  a  histogram  (global- 
based)  approach  were  to  be  employed  here  preprocessing  and 
segmentation  would  be  performed  on  every  frame  in  the  data 
set.  However,  as  mentioned  before  in  the  CCD-based  database 


mine-like  objects  tend  to  have  very  similar  pixel  intensity 
as  those  of  background  regions,  hence  making  global-based 
methods  inefficient  for  this  application.  In  contrast,  in  the 
local  block-based  approach  each  image  is  processed  block- 
by-block  using  a  local-based  Gauss-Gauss  detector  [11],  [12] 
which  exploits  local  statistical  properties  of  the  target  and  non¬ 
target  blocks. 

Only  blocks  that  have  similar  characteristics  to  targets  are 
flagged  as  detections.  Once  all  blocks  within  a  given  frame 
are  processed  then  a  collection  of  connected  blocks  will 
be  defined.  Conceivably  this  method  should  identify  all  the 
blocks  in  a  given  frame  that  belong  to  a  potential  target.  This 
collection  of  connected  blocks  will  result  in  a  segmented  target 
from  which  features  will  be  extracted.  The  proposed  local- 
based  method  accomplishes  two  goals:  (1)  determines  if  an 
object  (or  part  of  an  object)  exists  in  a  frame  thereby  detecting 
a  FOI  (This  reduces  the  number  of  frames  which  need  to  be 
looked  at  by  the  classifier);  and  (2)  automatically  gives  the 
location  of  the  potential  target  by  segmenting  the  ROI  with 
mine-like  characteristics  from  the  FOI.  In  what  follows,  we 
describe  the  theory  and  results  of  this  block-based  detector. 

A.  Review  of  Binary  Hypothesis  Testing 

The  classical  detection  problem  of  choosing  between  two 
hypotheses  [11]  is  that  given  an  N-dimensional  observation 
space,  where  x  =  [xi,  £2,  •  •  •  ,  %n]H  represents  an  observation 
vector  in  this  space,  we  would  like  to  test  between  Hi 
hypothesis  (true)  and  Ho  hypothesis  (null)  for  this  observation 
vector.  Clearly,  each  time  we  conduct  the  test  there  are  four 
possible  outcomes  that  are:  (a)  Ho  is  true  and  we  choose  Ho, 
(b)  Hq  is  true  and  we  choose  Hi,  (c)  Hi  is  true  and  we 
choose  Hi,  and  (d)  Hi  is  true  but  we  choose  Ho.  The  first 
and  third  outcomes  lead  to  correct  decisions  while  the  second 
and  fourth  outcomes  lead  to  erroneous  decisions.  The  Bayes 
test  is  based  on  two  assumptions.  First,  the  two  hypotheses, 
Ho  and  Hi,  correspond  to  two  possible  prior  probabilities, 
Po  and  Pi,  respectively.  These  probabilities  represent  the 
prior  observer’s  information  about  the  hypotheses  before  the 
detection  is  conducted.  The  second  assumption  is  that  there 
is  a  cost  associated  with  each  of  the  four  courses  of  action 
described  above.  These  costs  are  denoted  by,  Coo,  Cio,  Cii, 
and  Coi,  for  outcomes  1-4,  respectively.  It  is  assumed  that 
the  cost  of  a  wrong  decision  is  higher  than  the  cost  of  a 
correct  decision,  i.e.  Cio  >  Coo  and  Coi  >  Cn.  The  goal 
of  the  Bayes  test  is  to  design  a  decision  rule  so  that  on  the 
average  cost  of  a  decision  will  be  as  small  as  possible,  which 
subsequently  leads  to  the  smallest  Bayesian  risk  when  making 
the  decision. 

Because  the  decision  rule  is  binary,  i.e.  there  are  only  two 
possibilities,  either  Ho  and  Hi,  we  can  view  the  rules  as  a 
division  in  the  observation  space  into  two  parts  A0  and  Ai. 
In  other  words,  if  the  observation  is  found  in  the  region  Ao 
the  hypothesis  H0  is  declared  true  and  if  the  observation  is 
found  in  the  region  A\  the  hypothesis  Hi  is  declared  true.  By 
viewing  the  problem  in  this  manner  we  express  [11]  the  risk 


7Z  in  terms  of  the  decision  regions  and  probabilities  as, 

n  =  Coo-Po  /  Px\ho(,x\H0)  dx 

Ja0 

+  CwPo  /  Px\ho(*\H0)  dx 

J  At 

+  CuPi  I  Px\h1(^\Hi)  dx 
J  At 

+  C01P1  [  px\Ht(x\Hi)  dx.  (1) 

JAo 

where  pX|#-(x|i^)  is  the  conditional  probability  density  of 
observation  given  hypothesis  Hi,  i  =  0, 1.  To  find  the  decision 
rule,  the  decision  regions  are  determined  such  that  the  risk  in 
(1)  is  minimized.  Because  each  element  of  x  must  be  assigned 
to  either  the  Ao  or  Ai  in  the  observation  space  A,  we  can 
say  that  A  =  A0  U  A1  and  A0  D  A1  =  0.  Now,  if  we  use 
fAPx\H0(x\Ho)  dx  =  Ia  Px\hM\H-[  )  d,x  =  1,  then  (1)  can 
be  rewritten  [11]  as, 


1Z  —  P0C10  +  PiCn 

*  [  [Pi(Coi-Cu)p^  (xl#i) 

JA0 

-  Po(Cw  -  C00)px\h0(.x\ho)}  dx  (2) 


The  first  two  terms  in  (2)  represent  the  fixed  cost  and  the 
integral  represents  the  cost  controlled  by  the  points  in  the 
observation  space,  A  that  are  assigned  to  Ao.  The  points  in  A 
for  which  the  first  term  in  the  integral  which  are  larger  than  the 
second  term  are  assigned  to  Ai,  whereas  the  points  in  which 
the  second  term  is  larger  than  the  first  term  are  assigned  to 
Aq.  Any  points  in  which  the  terms  are  equal  have  no  effect 
on  the  cost  and  can  be  arbitrarily  assigned  to  any  region  (we 
assume  that  the  points  are  assigned  to  Ai).  We  can,  therefore, 
define  the  decision  region  in  the  observation  space  by 

Px|gl(x|gi)  Hi  Po{Cw  -  Coo ) 
px|ffo(*|tfo)  Ho  Pi(C01-Cn)' 


The  quantity  on  the  left  is  called  the  likelihood  ratio  and  will 
be  denoted  by 


u  o  A  PX|gi  (xlgl) 
Px\Hq  (x|Po) 


(4) 


The  relationship  on  the  right  is  the  threshold  of  the  test 
and  will  be  denoted  by  rj.  Thus,  Bayes  criterion  leads  to  a 
likelihood  ratio  test, 

l(x)  ^  T).  (5) 

Ho 


Our  proposed  method  for  hypothesis  testing  is  based  on 
the  Neyman-Pearson  criterion  [11],  in  which  the  hypothesis 
test  is  formulated  as  a  constrained  optimization  problem. 
In  this  optimization  problem  the  false  alarm  probability  is 
constrained  and  the  probability  of  detection  is  maximized. 
The  optimization  problem  yields  a  likelihood  ratio  test  and 
thresholding  conditions.  The  Neyman-Pearson  criterion  [11], 
[13]  generates  a  test  to  maximize  Pd  (probability  of  detection) 
while  making  Pfa  (probability  of  false  alarm)  as  small  as 


possible.  The  criterion  constrains  Pfa  =  a'  <  a  and  designs 
a  test  that  maximizes  the  probability  of  detection  under  this 
constraint  [11]. 

We  applied  a  block-based  likelihood  ratio  test  using  the 
standard  Gauss-Gauss  detector  [12]  which  is  used  to  determine 
if  a  block  belongs  to  a  potential  target  or  just  background. 
The  detection  problem  is  viewed  in  terms  of  the  signal  plus 
noise  model  [12],  [13],  the  decision  between  two  hypotheses 
is  now  either  background  (noise)  only  (Ho)  or  target  (signal) 
plus  background  (Hi).  Assuming  that  observation  block  of 
size  NxN  shaped  column-wise  into  a  vector  x  £  is 

Gaussian  distributed  with  zero-mean  and  covariance  matrix  R. 
We  test  the  hypothesis  H0  :  R  =  Ro,  he.  noise  alone  versus 
Hi  :  R  =  Ri,  i.e  signal  plus  noise  where  Ri  =  Ro  A  Rs, 
Ro  is  the  covariance  matrix  of  the  noise  alone,  and  Rs  is  the 
covariance  matrix  of  the  target  alone.  It  is  assumed  that  noise 
and  target  are  uncorrelated.  Thus,  the  conditional  probability 
density  function  for  a  given  hypothesis  Hi ,  i  £  [0, 1]  and  a 
given  observation  vector  x  is  given  by 

PxwMHi)  =  (6) 

where  pX|#i(xl#i)  is  the  conditional  probability  of  x  given 

Hi,ie[  o,i]. 

Using  the  likelihood  ratio  in  (4)  and  taking  the  natural  log, 
the  log-likelihood  of  x  becomes  [12]: 


i(x)  = 

=  In 

Vl^ol-i  ) 

=  \ln\Po\  ~  ^InlR^  +  ^xH  (Rg1  -  RI1)  x 

Disregarding  the  constants  that  are  not  observation  dependent, 
the  likelihood-ratio  for  the  Gauss-Gauss  detector  [12]  becomes 

l(x)  =  xH  (i?o  1  -  Ri1)  X.  (7) 

=  xhQx. 

Where  Q  =  R^1  —  /?£  Defining 

y  =  i?o1/2x  (8) 

C  _  7 1/2  TD  TD~ T /2 

O  —  -TLq  1  7  1q 

Where  S  is  the  ’ Signal-to-noise  ratio’  matrix  [12],  the  log- 
likelihood  in  8  ratio  now  takes  the  form: 

l(x)  =  yT (I  -  S-1) y  (9) 

Now,  if  we  express  S  in  terms  of  it’s  orthogonal  decom¬ 
position  S  =  UAUT  where  A  =  diag[\i...\]sr]  contains  the 
eigenvalues  of  S,  and  U  contain  the  associated  eigenvectors. 
The  log-likelihood  ratio  becomes 

l(x)=yTU(I-\-1)UTy 


(10) 


If  Ai  >  A2  >  ...  >  Ar  >  Ar+i  >  ...  >  Atv,  then  the  reduced 
rank  version  of  (10)  can  be  written  as 

lr(x)  =  yTU(Ir  -  A.~1)UTy  (11) 

Where  Ar  and  Ir  are  the  reduced  rank  version  of  I  and  A-1 
containing  r  non-zero  entries.  Using  this  rank  r  log-likelihood 
ratio,  (for  r  =  1)  is  implemented  for  each  block  to  determine 
if  the  block  belongs  to  a  target. 

B.  Detector  Design 

In  order  to  use  the  proposed  block-based  likelihood  detector 
a  training  set  must  be  used  to  design  the  detector.  The  selection 
of  a  set  of  blocks  from  targets  and  background  is  required.  This 
process  is  subjective  in  that  the  blocks  used  for  the  training 
must  be  specifically  picked  from  frames  which  are  believed 
to  represent  a  wide  range  of  targets.  Since  a  limited  number 
of  frames  containing  targets  are  available  in  this  database, 
blocks  from  a  two  data  runs  (TargetY8_001  &  SAM002_008) 
containing  targets  were  used.  In  order  to  form  the  training 
set,  regions  of  target  and  background  were  cropped  from  the 
frames  shown  in  Figure  3. 


TargetY8_001  Frame  71  SAM002_008  Frame  150 


Fig.  3.  FOI  from  data  runs  Target Y8_001  &  SAM002_008  used  to  train  the 
Gauss-Gauss  detector.  Regions  of  blocks  were  selected  over  the  target  and 
background  regions. 


More  specifically,  two  sets  each  containing  1465  blocks 
each  (2930  blocks  total)  were  selected  from  the  two  data 
runs  TargetY8_001  &  SAM002_008  for  target  and  non-target 
regions  to  form  covariance  matrices  R\  and  Rq,  respectively. 
These  were  selected  so  that  they  only  contain  target  or  non¬ 
target  pixels,  and  boundary  regions.  The  covariance  matrices 
Ri  and  Ro  are  then  formed  by  rearranging  the  blocks  into 
vectors  and  computing  the  sample  covariance  matrices.  After 
training  the  detector  was  evaluated  on  the  entire  data  set  in 
Table  I  in  order  to  evaluate  the  performance  for  FOI  detection. 

A  log-likelihood  threshold  needs  to  be  chosen  in  order 
to  differentiate  between  background  and  target  blocks.  This 
threshold  value  was  experimentally  chosen  to  be  5  based  on 


Fig.  4.  Block-based  detection  process. 


analyzing  several  different  blocks  over  both  target  and  non¬ 
target  regions.  It  turns  out  that  this  threshold  is  adequate  for 
most  of  the  data  sets. 

C.  Implementation  of  Block-based  Detection 

Each  frame  in  a  data  run  is  partitioned  into  blocks  of  size 
4x4  pixels.  Each  block  is  then  rearranged  into  a  vector  for 
computing  the  log-likelihood  ratio.  An  exaggerated  example 
of  the  blocking  is  shown  in  Figure  4. 

The  Gauss-Gauss  detection  is  then  performed  on  each 
block,  and  a  likelihood  value  is  computed.  This  generates  a 
‘likelihood  map’  image.  In  this  likelihood  map  image  each 
pixel  represents  the  value  of  the  log-likelihood  ratio  of  the 
corresponding  block  in  the  EO  image.  The  likelihood  maps  are 
then  used  to  determine  both  FOI  and  ROI  in  a  data  set  based 
on  thresholding  the  log-likelihood  ratio.  This  log-likelihood 
ratio  thresholding  is  based  on  the  training  frames  from  runs 
TargetY8_001  &  SAM002_008  . 

At  least  25  connected  blocks  must  be  detected  in  order 
for  the  frame  to  be  flagged  as  a  FOI.  Also  an  upper  size 
threshold  on  the  number  of  connected  blocks  may  be  imposed. 
If  too  many  connected  blocks  are  detected  then  the  frame  is 
assumed  to  contain  only  background.  These  size  constraints 
pose  another  challenge  to  the  FOI  detector.  If  the  vehicle 
carrying  the  sensor  is  high  above  the  targets,  the  targets 
may  appear  small,  and  may  be  missed  due  to  the  lower  size 
threshold,  conversely  if  the  vehicle  is  directly  over  the  target 
then  the  target  may  appear  too  large.  The  size  constraints 
were  determined  experimentally  using  the  different  targets  in 
the  database  to  be  25  to  2500  blocks.  The  overall  process  is 
described  in  the  following  steps: 

1)  Apply  the  likelihood  ratio  at  each  block  using  the  Gauss- 
Gauss  formulation  in  (5)  and  (8). 

2)  If  the  block’s  likelihood  ratio  falls  below  the  pre¬ 
specified  threshold  then  designate  the  particular  block  as 
’target’  otherwise  the  block  is  declared  as  a  background 
block. 

3)  Determine  the  number  of  connected  blocks  and  impose 
the  minimum  and  maximum  number  constraints  and  flag 
the  frame  under  consideration  as  a  FOI  if  all  required 
conditions  are  met. 


FOI  ROC  Curve  Likelihood  ratio  test 


Fig.  5.  FOI  ROC  curve  for  the  data  set  shown  in  Table  I  using  4x4  blocks. 


V.  Detection  Results 

Using  the  log-likelihood  ratio  test  for  FOI  detection  the 
receiver  operating  characteristic  (ROC)  curve  was  generated 
for  the  data  set  in  Table  I.  The  ROC  shown  in  Figure  5 
represents  the  FOI  detection  results,  which  show  how  many 
frames  are  detected  as  false  alarm  compared  to  actual  targets. 
This  ROC  is  based  on  ‘object’  detection,  i.e.  if  an  object  of 
25  or  more  connected  blocks  is  detected  the  frame  is  flagged 
as  a  detected  frame  (or  FOI).  This  ROC  gives  an  indication 
of  the  number  of  FOI  detected  per  run.  This  ROC  also  give 
an  indication  of  how  many  frames  are  pre-filtered  or  removed 
for  the  classification  stage.  Once  a  frame  is  flagged  as  FOI, 
the  ROI  of  the  mine-like  object  is  segmented  for  feature 
extraction  by  way  of  the  detected  connected  blocks.  Higher 
false  alarm  rates  during  FOI  detection  requires  the  overall 
system  to  perform  unnecessary  feature  extraction  and  slows 
down  the  classification  on  the  entire  run.  The  block-based 
log-likelihood  scheme  was  evaluated  on  the  data  set  for  4x4 
blocks  for  FOI  detection.  At  the  knee  point  of  ROC  where 
Pd  +  Pfa  =  1,  we  have  Pd  ~  65%  and  Pfa  &  35%.  This 
result  may  appear  disappointing  but  it  shows  that  for  Pd  =  1 
all  frames  containing  potential  objects  are  detected,  while  on 
average  half  the  frames  from  the  run  are  eliminated  from  the 
classification  stage.  Eliminating  frames  from  the  classification 
stage  will  ultimately  reduce  the  load  on  the  classifier.  Several 
detection  results  based  upon  4x4  block  size  detector  are  shown 
in  Figures  6  (a)-(c).  The  three  cases  shown  correspond  to  an 
easy,  a  medium,  and  a  difficult  target  case  respectively.  The 
examples  are  given  for  the  pre- specified  threshold  value  of 
5,  which  corresponds  to  Pd  =  72%  and  Pfa  =  39%  on  the 
ROC  curve.  It  can  be  seen  from  Figure  6(b)  that  the  target 
is  not  completely  segmented.  This  may  cause  problems  when 
extracting  features  and  trying  to  classify  the  ROI  as  target  or 
non-target  based  upon  the  silhouette. 

Figures  7  (a)-(c)  show  examples  of  false  alarms  when 


detecting  FOI.  False  alarms  shown  include  camera  lens  scratch 
(Figure  7(a)),  and  sea  bottom  features  (Figures  7(b)  and  7(c)). 
The  false  alarms  generated  in  these  data  sets  tend  to  be  less 
solid  and  have  less  regularity  in  their  shape.  Therefore,  it  may 
be  possible  to  reduce  the  false  alarm  by  imposing  a  regularity 
criterion. 

VI.  Conclusions  &  Future  Work 

This  paper  described  and  analyzed  a  block-based  detector 
for  detecting  underwater  targets.  Using  the  block-based  detec¬ 
tor  the  log-likelihood  ratio  is  computed  for  each  block  within 
all  frames  in  a  run.  Overall  a  detection  rate  of  100%  can  be 
achieved  at  the  cost  of  a  false  alarm  rate  of  50%  over  all  runs 
using  this  block-based  detection  method. 

The  main  challenge,  however,  is  that  a  good  ROI  must  be 
generated  before  feature  extraction,  in  addition  to  reducing 
the  number  of  false  alarm  rate.  Additionally,  detection  of 
partial  targets  and  classification  based  on  partial  silhouettes 
may  also  pose  major  challenges.  An  important  benefit  of  the 
proposed  detector  lies  in  the  fact  that  detection  of  FOI  and 
ROI  segmentation  can  be  achieved  in  a  single  step.  This 
FOI  detection  coupled  with  automatic  ROI  segmentation  will 
reduce  the  number  of  objects  from  which  features  need  to  be 
extracted,  as  well  as  reducing  the  overall  load  on  the  classifier. 
Although  FOI  detection  and  automatic  ROI  segmentation  are 
desirable  benefits  further  work  is  needed  in  order  to  fine- 
tune  and  improve  the  ROI  segmentation.  More  specifically, 
a  robust  object  silhouette  definition  needs  to  be  developed 
while  reducing  the  number  of  objects  extracted  from  each 
frame.  This  is  the  key  to  successful  object  classification 
and  identification.  The  algorithms  developed  in  this  research 
provided  good  results  for  underwater  target  detection  from 
CCD  EO  imagery.  Among  the  desirable  characteristic  of  the 
proposed  methods  is  also  the  simplicity  of  the  algorithm  for 
detecting  FOI  in  a  run,  while  automatically  segmenting  the 
ROI  of  mine-like  objects.  Unlike  the  methods  in  [1],  no 
preprocessing  is  needed  here  and  one  algorithm  is  used  for 
both  FOI  identification  and  ROI  extraction. 

Future  work  includes  using  Zernike  moments  [10],  [14], 
[15]  as  shape  dependent  features  for  classification  of  the 
detected  objects  in  this  data  set.  Work  is  devoted  to  extrac¬ 
tion  of  shape  dependent  feature  from  segmented  ROI.  These 
Zernike  moments  will  be  used  as  object  features  due  to  their 
rotational  and  translational  invariance  as  well  as  robustness 
to  noise.  Further  work  will  also  be  devoted  to  development 
of  a  GUI  application  for  training  and  using  the  detection  and 
classification  systems. 
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Fig.  6.  (a)  Detection  example  ’easy  target’  TargetY8_001.  Left:  FOI, 

Right:  detection  result,  (b)  Detection  example  ’medium  target’  SAM23_005. 
Left:  FOI,  Right:  detection  result  and  (c)  Detection  example  ’difficult  target’ 
SAM23_003.  Left:  FOI,  Right:  detection  result. 


(c)  Ocean  bottom  features 

Fig.  7.  (a)  Detection  false  alarm  example  camera  lens  scratch  SAM23_003. 

Left:  FOI,  Right:  detection  result,  (b)  Detection  false  alarm  example  sea 
bottom  SAM001_003.  Left:  FOI,  Right:  detection  result  and  (c)  Detection 
false  alarm  example  sea  bottom  TargetY8_003.  Left:  FOI,  Right:  detection 
result. 
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